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Abstract. This article was written for the Logic in Computer 
Science column in the February 2015 issue of the Bulletin of the 
European Association for Theoretical Computer Science. The in¬ 
tended audience is general computer science audience. 

The uncertainty principle asserts a limit to the precision with 
which position x and momentum p of a particle can be known si¬ 
multaneously. You may know the probability distributions of x and 
p individually but the joint distribution makes no physical sense. 
Yet Wigner exhibited such a joint distribution f{x,p). There was, 
however, a little trouble with it: some of its values were negative. 
Nevertheless Wigner’s discovery attracted attention and found ap¬ 
plications. There are other joint distribution, all with negative 
values, which produce the correct marginal distributions of x and 
p. But only Wigner’s distribution produces the correct marginal 
distributions for all linear combinations of position and momen¬ 
tum. We offer a simple proof of the uniqueness and discuss related 
issues. 


1. Introduction 

“Trying to think of negative probabilities,” wrote Richard Feynman, 
“gave me a cultural shock at hrst” [6] . Yet quantum physicists tolerate 
negative probabilities. Feynman himself studied, in the cited paper, a 
probabilistic trial with four outcomes with probabilities 0.6, —0.1, 0.3 
and 0.2. 

We were puzzled. The standard interpretation of probabilities de- 
hnes the probability of an event as the limit of its relative frequency 
in a large number of trials. “The mathematical theory of probability 
gains practical value and an intuitive meaning in connection with real 
or conceptual experiments” 0 §i.i|. Negative probabilities are obvi¬ 
ously inconsistent with the frequentist interpretation. Of course, that 
interpretation comes with a tacit assumption that every outcome is 
observable. In quantum physics some outcomes may be unobservable. 
This weakens the frequentist argument against negative probabilities 
but does not shed much light on the meaning of negative probabilities. 
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In the discrete case, a probabilistic trial can be given just by a set 
of outcomes and a probability function that assigns nonnegative reals 
to outcomes. One can generalize the notion of probabilistic trial by 
allowing negative values of the probabilistic function. Feynman draws 
an analogy between this generalization and the generalization from 
positive numbers, say of apples, to integers. But a negative number of 
apples may be naturally interpreted as the number of apples owed to 
other parties. We don’t know any remotely natural interpretation of 
negative probabilities. 

We attempted to have a closer look on what goes on. 

Heisenberg’s uncertainty principle asserts a limit to the precision 
with which position x and momentum p of a particle can be known 
simultaneously: (Jx<Jp > h/2 where (Tx,crp are the standard deviations 
and h is the (reduced) Planck constant. You may know the probability 
distributions (or the density function or the probability function; we 
will use the three terms as synonyms) of x and p individually but the 
joint probability distribution with these marginal distributions of x and 
p makes no physical sens^. Does it make mathematical sense? More 
exactly, does there exist a joint distribution with the given marginal 
distributions of x and p. 

In 1932, Eugene Wigner exhibited such a joint distribution [20] . 
There was, however, a little trouble with Wigner’s function. Some 
of its values were negative. The function, Wigner admits, “cannot be 
really interpreted as the simultaneous probability for coordinates and 
momenta.” But this, he continues, “must not hinder the use of it in cal¬ 
culations as an auxiliary function which obeys many relations we would 
expect from such a probability” [20]. Probabilistic functions that can 
take negative values became known as quasi-probability distributions. 

Richard Feynman described a specific quasi-probability distribution 
for discrete quantities, two components of a particle’s spin [6]. The 
uncertainty principle implies that these two quantities can’t have def¬ 
inite values simultaneously. So it seems plausible that an attempt 
to assign joint probabilities would again, as in Wigner’s case, lead to 
something strange — like negative probabilities. “Trying to think of 
negative probabilities gave me a cultural shock at first ... It is usual to 
suppose that, since the probabilities of events must be positive, a the¬ 
ory which gives negative numbers for such quantities must be absurd. 
I should show here how negative probabilities might be interpreted” 
[6]. His attitude toward negative probabilities echoes that of Wigner: 


^For general information about joint probability distributions and their marginal 
distributions see [51 §IX.l] 
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a quasi-probability distribution may be used to simplify intermediate 
computations. The meaning of negative probabilities remains unclear. 
Those intermediate computations may not have any physical sense. 
But if the hnal results make physical sense and can be tested then the 
use of a quasi-probability is justihed. 

It bothered us that both Wigner and Feynman apparently pull their 
quasi-probability distributions from thin air. In particular, Wigner 
writes that his function “was found by L. Szilard and the present au¬ 
thor some years ago for another purpose” [20], but he doesn’t give 
a reference, and he doesn’t give even a hint about what that other 
purpose was. He also says that there are lots of other functions that 
would serve as well, but none without negative values. He adds that 
his function “seemed the simplest.” 

We investigated the matter and made some progress. We found a 
characterization of Wigner’s function that might be considered objec¬ 
tive. 

Proposition 1 (Wigner Uniqueness). Wigner’s function is the unique 
quasi-distribution on the phase space that yields the correct marginal 
distributions not only for position and momentum but for all their linear 
combinations. 

Quisan0: Wait, I don’t understand the proposition. 
Wigner’s function is not a true distribution, so the no¬ 
tion of marginal distribution of Wigner’s function isn’t 
dehned. Also, what does it mean for a marginal to be 
correct? 

Author^: The standard dehnition of marginals works 
also for quasi-probability distributions. A marginal dis¬ 
tribution is correct if it coincides with the prediction of 
quantum mechanics. We’ll return to these issues in 1|3] 
and dH respectively. 

Q: To form a linear combination ax bp of position x 
and momentum p you add x, which has units of length 
like centimeters, and p, which has momentum units like 
gram centimeters per second. That makes no sense. 

A: We are adding ax and bp. Take a to be a momentum 
and fe to be a length; then both ax and bp have units 
of action (like gram centimeters squared per second), so 

^Readers of this column may remember Quisani, an inquisitive former student 
of the second author. 

^speaking one at a time 
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they can be added. If you want to make ax + bp a pure 
number, divide by h. 

Q: Finally, is it obvious that Wigner’s quasi-distribution 
is not determined already by the correct marginal dis¬ 
tributions for just the position and momentum, without 
taking into account other linear combinations? 

A: This is obvious. There are modihcations of Wigner’s 
quasi-distribution that still give the correct marginal dis¬ 
tributions for position and momentum. For an easy ex¬ 
ample, choose a rectangle R centered at the origin in 
the (x,p) phase plane and modify Wigner’s f{x,p) by 
adding a constant c (resp. subtracting c) when {x,p) G R 
and the signs of x and p are the same (resp. dif¬ 
ferent). For a smoother modihcation, you could add 
cxpexp{—ax^ — bp^) where a, 6, c are positive constants 
(of appropriate dimensions). 

The idea to consider linear combinations of position and momentum 
came from Wigner’s paper [20] where he mentions that projections to 
such linear combinations preserve the expectations. In fact, the projec¬ 
tions give rise to the correct marginals. This led us to the proposition. 

In the case of Feynman’s quasi-distribution mentioned above, one 
can’t use linear combinations of those two spin components to charac¬ 
terize the distribution. Nor is there a characterization using the spin 
component in yet another direction. Furthermore, if we only require the 
correct marginal distributions for the x and z spins, then there are gen¬ 
uine, nonnegative joint probability distributions with those marginals. 

Our investigation was supplemented by digging into the literature 
and talking to our colleagues, especially Nathan Wiebe. That brought 
us to “Quantum Mechanics in Phase Spaces: An Overview with Se¬ 
lected Papers” [22]. It turned out that there was another, much ear¬ 
lier, approach to characterizing the Wigner quasi-probability distri¬ 
bution. The main ingredient for that earlier approach is a proposal 
by Hermann Weyl m §IV.14] for associating Hermitian operators on 
to well-behaved functions g{x,p) of position and momentum. Jose 
Enrique Moyal used Weyl’s correspondence to characterize Wigner’s 
quasi-distribution in terms of only the expectation values but for a 
wider class of functions rather than the marginal distributions for just 
the linear functions of position and momentum im. There is a trade¬ 
off here. The class of functions is wider but the feature to match is 


narrower. 
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George Baker proved that any quasi-distribution on the position- 
momentum phase-space, satisfying his “quasi-probability distributional 
formulation of quantum mechanics,” is the Wigner function [2]. The 
problem of an objective characterization of Wigner’s function also at¬ 
tracted the attention of Wigner himself [211112] • 

The volume [22] does not contain “our” characterization of Wigner’s 
function but it is known and due to Jacqueline and Pierre Bertrand 
[3]. They found an astute name for the approach: tomographic. The 
tomographic approach gives an additional conhrmation of the fact that 
behavior of our quantum system is not classical. The approach can be 
used to establish that the behavior of some other quantum systems is 
not classical. It has indeed been used that way in quantum optics; see 
[laila] for example. 

Still, in our judgment, our proof of the uniqueness of Wigner’s func¬ 
tion is simpler and more direct than any other in the literature, and so 
we present it in jlH In section j|5]we establish Moyal’s characterization 
of Wigner’s function. 

Uni contains a cursory discussion related to Feynman’s four-outcome 
quasi-distribution. All our observations on that issue happened to be 
known as well, but we have yet to research the history of the negative 
probabilities in the discrete case. We intend to address the discrete 
case elsewhere. 

Q: So what is the meaning of negative probability? 

A: We don’t know. 

Q: The use of negative probabilities to validate the quan¬ 
tum character of a quantum system reminds me proofs 
by contradiction. Assume that the behavior is classical, 
produce a unique joint distribution, prove the existence 
of negative values and establish a contradiction. If this 
is the only use of negative probabilities then there is no 
need to interpret them semantically. 

A: There are some attempts to use negative probabilities 
as a measure of “quantumness” [ninB]. We think that 
the jury is still out. 

Q: I have yet another question. Recently, in this very 
column, Samson Abramsky wrote about contextuality 
which is another manifestation of non-classical behavior 
of quantum systems [1] . I wonder what is the relation, if 
any, between negative probabilities and contextuality. 
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A: The discussion of that relation is beyond the scope of 
this paper. But please have a look at Robert Spekkens’s 
article [16] with a rather telling title “Negativity and 
contextuality are equivalent notions of nonclassicality.” 

Acknowledgment. Many thanks to Nathan Wiebe who was our guide 
to the literature and the state of art on quasi-probabilities. 


2. Preliminaries 


We tried to make the paper as accessible as possible; hence this 
section. We still assume some familiarity with mathematical analysis. 
By default in this paper integrals are from —oo to -|-oo. The baby 
quantum theory that we use is covered in §3 of book [S] titled “A hrst 
approach to quantum mechanics,”. 

2.1. Fourier transform. The forward Fourier transform sends a func¬ 
tion /(x) to 



and the inverse Fourier transform sends a function g{^) to 



Mathematically x and ^ are real variables. In applications, the dimen¬ 
sion of ^ is the inverse of that of x so that is a pure number. 

The forward and inverse Fourier transforms are dehned also for func¬ 
tions of several variables. In particular. 



Q: What about the convergence of the integrals? Are 
you going to ignore such details? 

A: Yes, we are going to ignore such details. But Fourier 
transforms are used, with full mathematical rigor, even 
in some situations where the integrals don’t converge. 

Q: I do not understand this. 

A: The idea is to hrst dehne the Fourier transform as an 
operator on nice functions in L^(M), for which the inte¬ 
grals clearly converge. Informally a function /(x) is nice 
if it and its derivatives /'(x), /"(x), /"'(x),... approach 
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zero very rapidly as x — )■ cxd. The Fourier transform is an 
isometry on these nice functions, and the nice functions 
are dense in so the isometry extends to all of L^. 

Details can be found in books on real analysis, like [9] 
and [H]; alternatively, see [H Appendix A.3.2]. 


2.2. Dirac’s delta function. Dirac’s ^-function is a generalized func¬ 
tion such that for any nice function /, 

J fix)6{x)dx = /(O). 

It follows that 

J f {x)5{x — a)dx = J f{x + a)6{x)dx = f{a). 

Some divergent integrals, e.g. J e^^^dt, can be seen as generalized 
functions in that sense. In fact, as generalized functions, 

[ e^^^dt = 27r5{x). 


Indeed, 


dxf{x) J e''*^dt = V^ J dt—^= J f{x)e^^^dx 


= [ f{t) dt 


= 271- 




f{t)e * dt = 27r/(0). 


Q: Are these nice functions the same as the nice functions 
mentioned earlier. 

A: Yes, they are. 


2.3. Exponential operators. The exponential of an operator O 
over a topological vector space is the operator 

= y2 — = I + O + + ... 

^ k\ 2 6 

k=0 

If {X'ip){x) = X ■ Tp^x) then (e^pj^x)) = because 
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If D is the derivative operator then 
= '0(a; + a). Indeed, 



which is the Taylor series of + a) around point x. (Think of a as 
Ax.) 


Q: What functions / are you talking about? The Taylor 
series expansion of / suggests that / is analytic, that is 
real-analytic. 

A: Our intention is that / ranges over L^. By the proof 
above, is a shift f{x) i-A f{x + a) on analytic func¬ 
tions. In particular, is a shift on Gaussian functions 



But Gaussian functions span a dense subspace of L^(M), 
and there is a unique continuous extension of to L^, 
namely the shift /(x) i-A f{x + a). 

Q: The exponential has got to be a partial operator 
in general. 

A: Yes, e^{x) is dehned whenever the oprators are 
dehned at x, and the series ^ converges. 


3. Joint-to-Marginal Lemma 


Let f{x,p) be an ordinary probability distribution or a quasi¬ 
distribution on For any z = ax + bp where a, b are not both zero, 
the marginal distribution g{z) of z can be dehned thus: 
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Here’s a justification in the case b ^ 0. We have 


P= ax), 

dp = -^{dz — a dx), 



We are relying here on the formalism of differential 2-forms [7] for area 
elements, so that dx dz really means dx A dz and we have used that 
dxAdx = 0. The use of differential forms makes computations like this 
easier, and it fits well with physics, e.g., with Maxwell’s equations and 
with general relativity. One could, however, avoid differential forms 
here and get the same result by considering the Jacobian determinant 
of the change of variables. 

For any real u < v, the probability that u < z < v should be 



Since the first and last expressions coincide for all u < n, we have 



Lemma 2 (J2M). For any a, b not both zero, the following statements 
are equivalent. 


(1) qiz) is the marginal distribution of z = ax + bp. 

( 2 ) m = V^-fiaC,bC). 
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Proof. To prove (1)^(2), suppose (1) and compare the forward Fourier 
transforms of g and /: 

^(0 = 


/(^,h) = 

We have g{C) = V^f{aC,bC). 

To prove (2)—suppose (2) and use the implication (1)^(2). If 
h is the marginal distribution oi z = ax + by then 

h{C) = ■ /(aC, bC) = g{C), 

and therefore g = h. □ 

Corollary 3. For any real a, jS not both zero, f{a, (3) = g{Q where 
g{z) is the marginal distribution for the linear combination z = ax + bp 
such that a = a(, (3 = h( for some (. 

4. WiGNER UNIQUENESS 

The purpose of this section is to prove the Wigner Uniqueness propo¬ 
sition. For simplicity we work with one particle moving in one dimen¬ 
sion, but everything we do in this section generalizes in a routine way 
to more particles in more dimensions. 

In classical mechanics, the position x and momentum p of the particle 
determine its current state. The set of all possible states is the phase 
space of the particle. By Corollary [3l an ordinary distribution f{x,p) 
on the phase space is uniquely determined by its marginal distributions 
for all linear combinations ax -\- bp where a, b are not both zero. 

In the quantum case, a state of the particle is given by a normalized 
(to norm 1) vector l"^) in L^(M). The position and momentum are 
given by Hermitian operators X and P where 

{X'il)){x) = X ■ 'ip{x) and {P'il)){x) = — 

dx 

For any a, b not both zero, the linear combination z = ax + by is given 
by the Hermitian operator Z = aX + bP. In a state l"^), there is a 
probability distribution g{z) (for the measurement) of the values of z. 
For a function h{z) of z, the expectation of h{z) is {'ip\h{Z)\'ijj). 
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The following technical lemma plays a key role in our proof of the 
uniqueness of Wigner’s quasi-distribution. 

Lemma 4. 

(^|g-*(aV+/3P)|^) _ gia/3ft/2 J ^ (^y _ dy. 

Proof. We want to split the exponential into a factor with X times 
a factor with P. This is not as easy as it might seem, because X 
and P don’t commute. We have, however, two pieces of good luck. 
First, there is Zassenhaus’s formula, which expresses the exponential 
of a sum of non-commuting quantities as a product of (inhnitely) many 
exponentials, beginning with the two that one would expect from the 
commutative case, and continuing with exponentials of nested commu¬ 
tators: 

where the “• • • ” refers to factors involving double and higher commu¬ 
tators. 


Q: You gave no reference to Zassenhaus’s paper. 

A: Apparently, Zassenhaus never published this result, 
but there’s a paper [Ij that shows how to compute the 
next terms. It also has a pointer to early uses of the 
formula. 

The second piece of good luck is that [X, P] = ihl, where I is the 
identity operator. (In the future, we’ll usually omit writing / explicitly, 
so we’ll regard this commutator as the scalar ih.) Since that commutes 
with everything, all the higher commutators in Zassenhaus’s formula 
vanish, so we can omit the “• • • ” from the formula. We have 

(^|g-iaX-i/3P|^) ^ ^^|g-iaXg-i/3Pga/3[X,P]/2|^^_ 

The last of the three exponential factors here arose from Zassenhaus’s 
formula as 

— -[—iaX,—i(3P] = -a(3[X,P] = ia(3h/2. 

That factor, being a scalar, can be pulled out of the bra-ket. Taking 
into account ^2.31 

(^|g-i(aV+/3P)|^) _ ^iayh/2 f ^(^y _ dy. 


□ 
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Now we are ready to prove the Wigner Uniqueness proposition. Sup¬ 
pose that a quasi-distribution f{x,p) yields correct marginal distribu¬ 
tions for all linear combinations of position and momentum. For any 
real a, (3 not both zero, let a, b, g, ( be as in Corollary |3l Then 



By Lemma m 


^ ia^h/2 r 

(2) /(a, /3) = J - ^h) dy. 

To get f{x,p), apply the (two-dimensional) inverse Fourier transform. 

Collecting the three exponentials that have a in the exponent, and 
noting that a appears nowhere else in the integrand, perform the inte¬ 
gration over a and (recall ^2.21) get a Dirac delta function; 

J Q-iaiy-^-x) ^ 27lS{y — X — ^). 

That makes the integration over y trivial, and what remains is 

(3) P) = ^ j 

which is Wigner’s quasi-distribution. 

To check that Wigner’s quasi-distribution yields correct marginal 
distribution note that the derivation of (jS]) from ([1]) is reversible. This 
completes the proof of the Wigner Uniqueness proposition. 

5. Weyl’s correspondence 

There is another approach to characterizing the Wigner quasi- 
probability distribution, using the expectation values for a wide class 
of functions rather than the marginal distributions for just the linear 
functions of position and momentum. The main ingredient for this ap¬ 
proach is a proposal by Hermann Weyl [T^ §IV.14] for associating a 
Hermitian operator on L^(M) to any (well-behaved) function g{x,p) of 
position and momentum. Weyl’s proposal is to first form the Fourier 
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transform g{a,P) of g{x,p), and then apply the inverse Fonrier trans¬ 
form with the Hermitian operators X and P in place of the classical 
variables x and p. Thus, the Weyl correspondence associates to g{x,p) 
the operator 

g{X,P) = ^jj 

If one grants that this is a reasonable way of converting phase-space 
functions g{x,p) to operators g{X,P), then a desirable property of a 
phase-space quasi-probability distribution f{x,p) would be that the 
expectation of g{X, P) in a quantum state \'ip) is the same as the ex¬ 
pectation of g{x,p) under f{x,p). We shall show that the Wigner dis¬ 
tribution is uniquely characterized by enjoying this desirable property 
for all well-behaved g. 

Indeed, the expectation of g{X, P) in state |'0) is 

^ jj dad(3, 

and the expectation of g{x,p) under the distribution f{x,p) is 


g{x,p)f{x,p)dxdp= // g{a,f3)f{a,(3)dadf3. 


This last equation is a consequence of the fact, mentioned in §2.11 that 
the Fourier transform is a unitary operator and therefore preserves the 
inner product structure of Since these two expectations agree 

for all (well-behaved) g, 

f(a,l3) = 

But this is the part of equation([T]) that was used to derive Wigner’s 
formula ([3]). 


Q: I wonder how Weyl arrived at his proposal. 

A: Weyl presents his proposal in [19] without any moti¬ 
vation, so we don’t know how he came up with it, but we 
can speculate. The title of [I9| indicates that Weyl was 
working in a group-theoretic context. As a result, the 
Fourier transform, expressing functions on R as combi¬ 
nations of the characters of the group (R, -|-) would 
be in the forefront of his considerations. Now consider his 
goal — to somehow convert a classical function g{x,p) 
into an operator. Roughly speaking, he would want to 
substitute the operators X and P for the classical vari¬ 
ables X and p. An obvious difficulty is that the same 
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functions g{x,p) might have two different expressions, 
for example xp = px, which are no longer equivalent 
when operators are substituted, XP ^ PX. So it is 
reasonable to try to choose, from the many expressions 
for a function g{x,p), one particular, reasonably canoni¬ 
cal expression, into which one can substitute X and P. 
The Fourier expansion, f g{a, (3) exp{i{ax + (3p))dxdp 
has those properties. It depends only on the function g, 
not on how one chooses to express it, and there is no 
problem substituting X and P for x and p. 


6. Feynman and spins 


Richard Feynman studied “an analogue of the Wigner function for 
a spin I system or other two state system” [B]. He chose the z and 
X components of the spin to serve as the analogs of the position and 
momentum in Wigner’s formula. 

Q: His case should be much simpler than Wigner’s case. 

A: Not necessarily. While the commutator [X, P] is a scalar, the com¬ 
mutator of the z and x components of a spin is the ^/-component times 
ih. This is but one of several complications. 

Q: Is Feynman’s quasi-distribution determined by the correct 
marginals for all linear combinations of the x and z spins? 

A: That question sounds reasonable until you look at it a little more 
closely. To £x notation, let’s describe spin by means of the standard 
Pauli matrices 



The usual matrix representation of the spins for a spin ^ particle is 
given by these matrices divided by 2, but it’s convenient to skip those 
extra factors if you like, imagine that we are measuring angular 
momentum in units of h/2 instead of h. 

So each of our matrices has eigenvalues ±1, with -|-1 meaning spin 
along the corresponding positive axis and —1 along the corresponding 
negative axis. For example, the two basis states of spin up and spin 
down along the z axis are the eigenvectors for eigenvalues 1 and —1 of 
Z; equivalently, they correspond to eigenvalues 1 and 0 for (J -|- Z)/2, 
where I is the identity operator. This point of view is useful because 
it implies that, in any state (1 -|- {Z))/2 is the probability that 
the z spin is up. Here, as before, angle brackets denote expectations. 
Similarly, (1 — {Z))/2 is the probability that the spin is down. Of 
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course, analogous formulas apply to the x and y components of the 
spin. 

Feynman, in analogy to Wigner, introduces a quasi-probability dis¬ 
tribution / for the pair of non-commuting observables Z and X. So f 

has four components, /++, /+_, /_+, and /_, as the quasi-probability 

of Z and X having the values ±1 given by the subscripts of /. 

Now let’s look at a linear combination of Z and X, for example 
the simplest nontrivial one, Z + X. From the point of view of quasi¬ 
probabilities. 


• with probability /++, Z has value 1 and X has value 1, so Z+X 
has value 2, 

• with probability _, Z has value 1 and X has value —1, so 

Z + X has value 0, 

• with probability /_+, Z has value —1 and X has value 1, so 
Z + X has value 0, and 

• with probability /_, Z has value —1 and X has value —1, so 

Z + X has value —2. 


Altogether, the possible values oi Z + X are 2, 0, and —2, with quasi- 
probabilities /++, /+_ -|- /_+, and /_, respectively. 

So your proposed analog of the result for Wigner’s distribution would 
assume that / is chosen so that these probabilities agree, in a given 
state, with the probabilities computed by quantum mechanics. But 
such agreement is impossible, because, according to quantum mechan¬ 
ics, the possible values oi Z + X are the eigenvalues of this operator, 
namely ±\/2, which are completely different from the 2, 0, —2 arising 
from the quasi-probabilities. It is easy to check that the possible val¬ 
ues of any nontrivial linear combination of Z and X are completely 
different from the values arising from the quasi-probabilities. 


Q: OK, let’s require the minimum that Feynman obviously intended, 
namely that / should produce the correct marginal distributions of Z 
and X. Does this determine / uniquely? 


A: At hrst sight, this looks promising. Requiring the correct marginals 
for the two variables, each having two possible values, gives us four 
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equations for the four unknown components f±± of /: 

/++ + U- = i(l + (Z» 

/-+ + /- = i(l - {Z)) 

/++ + /-+ = i(l + (A')) 

A-+/-= i(i - (A». 

But there’s redundancy in the equations; only three of them are inde¬ 
pendent, so there’s one free parameter in the general solution. In fact, 
it’s easy to write down the general solution; 

/++ = ^(1 + {^) + {^) + 

A- = i(l + (Z) - (A) - () 

A+ = j(i - (Z) + (A) - () 

A- = i(l-(Z)-(A) + (), 

where t is arbitrary. Feynman’s formulas correspond to f = (Y) but 
we see no reason to prefer (Y) over, for example, —{Y). 

Q: Put the freedom in choosing t to some use. How about minimizing 
the negativity in /? In other words, adjust t to bring / as close as 
possible to being a genuine probability distribution. 

A: That idea works better than we originally expected. One can get 
rid of the negativity altogether. For each state |'0), there is a choice of 
t that makes all four components of / nonnegative. 

Indeed, write down the four inequalities f±± > 0 using the formulas 
above for these /±±’s. Solve each one for t. You find two lower bounds 
on t, namely 

-1 - (Z) - {X) (from /++ > 0) 

-1 + {Z) + {X) (from /__ > 0), 
and two upper bounds, namely 

1 + {Z)- {X) (from /+_ > 0) 

I-{Z) + {X) (from /_+ > 0). 

An appropriate t exists if and only if both of the lower bounds are less 
than or equal to both of the upper bounds. That gives four inequalities. 
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which simplify to —1 < (Z) < 1 and —1 < {X) < 1. But these are 
always satisfied, because the eigenvalues of Z and X are ±1. 

Q: I am confused. The uncertainty principle asserts that you cannot 
measure Z and X at once. Accordingly one would expect that the joint 
probability distribution f±± should not exist or, as in Wigner’s case, 
should have at least one negative value. 

A: The relevant difference between Wigner’s and Feynman’s cases 
seems to be this. In Wigner’s case, there is naturally a rich set of 
marginals that the joint probability distribution is supposed to pro¬ 
duce, namely the probability distributions of all linear combinations of 
the position x and the momentum p of the particle. In Feynman’s case, 
the natural set of marginals is too poor, just the probability distribu¬ 
tions of Z and X. 

Q: Did Feynman find a good use for quasi-probabilities? 

A: He introduced negative probabilities in connection to a problem of 
infinities in quantum field theory. “Unfortunately I never did find out 
how to use the freedom of allowing probabilities to be negative to solve 
the original problem of infinities in quantum field theory!” [6]. 

Q: Still, the idea to use quasi-probability distributions to simplify in¬ 
termediate computations looks attractive to me. 

A: You are in good company. 
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